Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

INFTY: An integrated OCR system for mathematical documents

Identifieur interne : 001790 ( Main/Exploration ); précédent : 001789; suivant : 001791

INFTY: An integrated OCR system for mathematical documents

Auteurs : Masakazu Suzuki (mathématicien) [Japon] ; Fumikazu Tamari [Japon] ; Ryoji Fukuda [Japon] ; Seiichi Uchida [Japon] ; Toshihiro Kanahori [Japon]

Source :

RBID : Pascal:05-0039231

Descripteurs français

English descriptors

Abstract

An integrated OCR (Optical Character Recognition) system for mathematical documents, called INFTY, is presented. INFTY consists of four procedures, i.e., layout analysis, character recognition, structure analysis of mathematical expressions, and manual error correction. In those procedures, several novel techniques are utilized for better recognition performance. Experimental results on about 500 pages of mathematical documents showed high character recognition rates on both mathematical expressions and ordinary texts, and sufficient performance on the structure analysis of the mathematical expressions.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">INFTY: An integrated OCR system for mathematical documents</title>
<author>
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Faculty of Mathematics, Kyushu University</s1>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Tamari, Fumikazu" sort="Tamari, Fumikazu" uniqKey="Tamari F" first="Fumikazu" last="Tamari">Fumikazu Tamari</name>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>Department of Information Education, Fukuoka University of Education</s1>
<s3>JPN</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Department of Information Education, Fukuoka University of Education</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Fukuda, Ryoji" sort="Fukuda, Ryoji" uniqKey="Fukuda R" first="Ryoji" last="Fukuda">Ryoji Fukuda</name>
<affiliation wicri:level="1">
<inist:fA14 i1="03">
<s1>Department of Human Welfare Engineering, Oita University</s1>
<s3>JPN</s3>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Department of Human Welfare Engineering, Oita University</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
<affiliation wicri:level="4">
<inist:fA14 i1="04">
<s1>Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s3>JPN</s3>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Kanahori, Toshihiro" sort="Kanahori, Toshihiro" uniqKey="Kanahori T" first="Toshihiro" last="Kanahori">Toshihiro Kanahori</name>
<affiliation wicri:level="1">
<inist:fA14 i1="05">
<s1>Research Center on Educational Media, Tsukuba College of Technology</s1>
<s3>JPN</s3>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Research Center on Educational Media, Tsukuba College of Technology</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">05-0039231</idno>
<date when="2003">2003</date>
<idno type="stanalyst">PASCAL 05-0039231 INIST</idno>
<idno type="RBID">Pascal:05-0039231</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000494</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000295</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000554</idno>
<idno type="wicri:Area/Main/Merge">001868</idno>
<idno type="wicri:Area/Main/Curation">001790</idno>
<idno type="wicri:Area/Main/Exploration">001790</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">INFTY: An integrated OCR system for mathematical documents</title>
<author>
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
<affiliation wicri:level="4">
<inist:fA14 i1="01">
<s1>Faculty of Mathematics, Kyushu University</s1>
<s3>JPN</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university" n="3">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Tamari, Fumikazu" sort="Tamari, Fumikazu" uniqKey="Tamari F" first="Fumikazu" last="Tamari">Fumikazu Tamari</name>
<affiliation wicri:level="1">
<inist:fA14 i1="02">
<s1>Department of Information Education, Fukuoka University of Education</s1>
<s3>JPN</s3>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Department of Information Education, Fukuoka University of Education</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Fukuda, Ryoji" sort="Fukuda, Ryoji" uniqKey="Fukuda R" first="Ryoji" last="Fukuda">Ryoji Fukuda</name>
<affiliation wicri:level="1">
<inist:fA14 i1="03">
<s1>Department of Human Welfare Engineering, Oita University</s1>
<s3>JPN</s3>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Department of Human Welfare Engineering, Oita University</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
<affiliation wicri:level="4">
<inist:fA14 i1="04">
<s1>Faculty of Information Science and Electrical Engineering, Kyushu University</s1>
<s3>JPN</s3>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<placeName>
<settlement type="city">Fukuoka</settlement>
<region type="province">Kyūshū</region>
<region type="prefecture">Préfecture de Fukuoka</region>
</placeName>
<orgName type="university">Université de Kyūshū</orgName>
</affiliation>
</author>
<author>
<name sortKey="Kanahori, Toshihiro" sort="Kanahori, Toshihiro" uniqKey="Kanahori T" first="Toshihiro" last="Kanahori">Toshihiro Kanahori</name>
<affiliation wicri:level="1">
<inist:fA14 i1="05">
<s1>Research Center on Educational Media, Tsukuba College of Technology</s1>
<s3>JPN</s3>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Japon</country>
<wicri:noRegion>Research Center on Educational Media, Tsukuba College of Technology</wicri:noRegion>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithm</term>
<term>Integrated system</term>
<term>Mathematical formula</term>
<term>Mathematics</term>
<term>Optical character recognition</term>
<term>Performance evaluation</term>
<term>Structural analysis</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Reconnaissance optique caractère</term>
<term>Mathématiques</term>
<term>Formule mathématique</term>
<term>Système intégré</term>
<term>Analyse structurale</term>
<term>Algorithme</term>
<term>Evaluation performance</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Mathématiques</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">An integrated OCR (Optical Character Recognition) system for mathematical documents, called INFTY, is presented. INFTY consists of four procedures, i.e., layout analysis, character recognition, structure analysis of mathematical expressions, and manual error correction. In those procedures, several novel techniques are utilized for better recognition performance. Experimental results on about 500 pages of mathematical documents showed high character recognition rates on both mathematical expressions and ordinary texts, and sufficient performance on the structure analysis of the mathematical expressions.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Japon</li>
</country>
<region>
<li>Kyūshū</li>
<li>Préfecture de Fukuoka</li>
</region>
<settlement>
<li>Fukuoka</li>
</settlement>
<orgName>
<li>Université de Kyūshū</li>
</orgName>
</list>
<tree>
<country name="Japon">
<region name="Kyūshū">
<name sortKey="Suzuki, Masakazu" sort="Suzuki, Masakazu" uniqKey="Suzuki M" first="Masakazu" last="Suzuki">Masakazu Suzuki (mathématicien)</name>
</region>
<name sortKey="Fukuda, Ryoji" sort="Fukuda, Ryoji" uniqKey="Fukuda R" first="Ryoji" last="Fukuda">Ryoji Fukuda</name>
<name sortKey="Kanahori, Toshihiro" sort="Kanahori, Toshihiro" uniqKey="Kanahori T" first="Toshihiro" last="Kanahori">Toshihiro Kanahori</name>
<name sortKey="Tamari, Fumikazu" sort="Tamari, Fumikazu" uniqKey="Tamari F" first="Fumikazu" last="Tamari">Fumikazu Tamari</name>
<name sortKey="Uchida, Seiichi" sort="Uchida, Seiichi" uniqKey="Uchida S" first="Seiichi" last="Uchida">Seiichi Uchida</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001790 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001790 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:05-0039231
   |texte=   INFTY: An integrated OCR system for mathematical documents
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024